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Abstract 

The problem of computing all maximal induced subgraphs of a graph G that have a graph 
property P, also called the maximal V '-subgraphs problem, is considered. This problem is studied 
for hereditary, connected-hereditary and rooted-hereditary graph properties. The maximal P- 
subgraphs problem is reduced to restricted versions of this problem by providing algorithms 
that solve the general problem, assuming that an algorithm for a restricted version is given. 
The complexity of the algorithms are analyzed in terms of total polynomial time, incremental 
polynomial time and the complexity class P-enumerable. The general results presented allow 
simple proofs that the maximal P-subgraphs problem can be solved efficiently (in terms of the 
input and output) for many different properties. 

1 Introduction 

Hereditary and connected-hereditary graph properties include many common types of graphs such 
as cliques, bipartite graphs and trees. Such properties appear in many contexts, and thus, they 
have been widely studied, e.g., [14, 20, 3, 17]. This paper focuses on the maximal V -subgraphs 
problem: Given a graph property P and an arbitrary graph G, find all maximal induced subgraphs 
of G that have the property P. We consider properties that are hereditary, connected hereditary 
or rooted hereditary (a variant of connected hereditary). 

Since the output for the maximal P-subgraphs problem may be large, our complexity analysis 
takes into consideration both the size of the input and the size of the output. Specifically, we 
consider the complexity measures total polynomial time [10] and incremental polynomial time [10], 
and the complexity class P-enumerable [19]. 

The maximal P-subgraphs problem has been studied for many properties P. For example, it 
has been shown that this problem is both P-enumerable and solvable in incremental polynomial 
time for the properties "is an independent set" and "is a clique" [10, 18, 1]. If subgraphs (and not 
only induced subgraphs) are allowed, then the maximal P-subgraphs problem is P-enumerable for 
the properties "is a spanning tree" [16, 13], "is an elementary cycle" [16, 9] and "is an elementary 
path" [16, 2], among others. 

This paper differs from previous work in that we do not consider specific properties, but instead, 
deal with the problem for a general P. Our strategy is to reduce the maximal P-subgraphs problem 
to restricted versions of the maximal P-subgraphs problem that are often easier to solve. Our 
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reductions are by means of algorithms that solve the maximal "P-subgraphs problem, given a solution 
to a restricted version of this problem. 

Using our algorithms, we can show that the maximal P-subgraphs problem can be solved in 
total-polynomial time, incremental-polynomial time or is P-enumerable if certain conditions hold on 
the runtime of a restricted version of this problem. Hence, our approach is easily shown to include 
and improve upon previous results in graph theory. For example, it is shown that the maximal 
"bipartite" -subgraph problem is solvable in total polynomial time, which improves upon [15]. 

The maximal P-subgraphs problem also has immediate practical applications in the database 
field. Interestingly, it turns out that many well-known semantics for answering queries in the 
presence of incomplete information can be modeled as hereditary, connected-hereditary or rooted- 
hereditary graph properties. Hence, the results in this paper imply and improve upon the com- 
plexity results in [11, 12, 4] and imply the complexity result in [21]. In fact, modeling semantics 
as graph properties allows previously presented semantics to be extended without affecting their 
complexity. See [5] for more details. 

2 Graphs and Graph Properties 

Graphs and Induced Subgraphs. A graph G = (V, E, r) consists of (1) a finite set of vertices 
V, (2) a set of edges E C V x V and (3) a root r such that reFU {-L}. We say that G is rooted 
if (1) r 7^ _L and (2) every vertex in G is reachable via a directed path from r. We say that G 
is connected if its underlying undirected graph is connected. Observe that every rooted graph is 
connected. However, a connected graph need not be rooted. We use V(G) to denote the set of 
vertices of G. 

A graph H is an induced subgraph of a graph G, written H C IS G, if (1) H is derived from G 
by deleting some of the vertices of G (and the edges incident on these vertices) and (2) H has the 
same root as G, if the root of G is among the vertices of H, and has _L as its root otherwise. We 
write H <Z m G if H C IS G and H is not equal to G. 

We use G[{vi, . . . , v n }] to denote the induced subgraph of G that contains exactly the vertices 
vi,..., v n . If H and H' are induced subgraphs of G and v is a vertex in G, we use G[H], G[H, v] and 
G[H,H'} as shorthand notations for G[V(H)], G[V(H) U {v}] and G[V(H) U V(H% respectively. 

Graph Properties. A graph property V is a nonempty and possibly infinite set of graphs. For 
example, "is a clique" is a graph property that contains all graphs that are cliques. In this paper, we 
only consider properties V such that it is possible to verify whether a graph G is in V in polynomial 
time. Hence, we assume that there is a polynomial procedure Sat(V) that receives a graph G as 
input, and returns true if G E V and false otherwise. Observe that the notation (V) denotes an 
algorithm that is parameterized by the graph property V, i.e., that differs for each value of V ■ 

We consider several special types of graph properties. A graph property V is hereditary if 
V is closed with respect to induced subgraphs, i.e., whenever G £ V, every induced subgraph 
of G is also in V . A graph property V is connected hereditary if (1) all the graphs in V are 
connected and (2) V is closed with respect to connected induced subgraphs. A graph property V 
is rooted-hereditary if (1) V only holds on rooted graphs and (2) V is closed with respect to rooted 
induced subgraphs. 1 It is rather unusual for a rooted- hereditary property to also be hereditary 
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or connected-hereditary. Actually, one can show that if V is rooted-hereditary and P is also 
hereditary or connected-hereditary, then V contains only graphs with at most one vertex. This 
gives an additional motivation to considering rooted-hereditary properties, since they generally 
differ from connected- hereditary and hereditary properties. 

Many graph properties are hereditary [8], e.g., "is a clique" and "is a forest." Note that "is a 
clique" is also connected hereditary. However, "is a clique" is not rooted hereditary, since it contains 
graphs that do not have roots. Some properties are connected hereditary, but not hereditary or 
rooted hereditary, such as "is a tree," which contains a graph G if the underlying undirected graph 
of G is a tree. Note that G is not necessarily rooted. Hence, "is a tree" is not rooted hereditary. 
The property "is a rooted clique" is rooted hereditary. 

The Maximal P-Subgraphs Problem. Let G be a graph and V be a property. (The graph G 
is not necessarily in P.) We say that H is a V-subgraph of G if H C IS G and H € P. The set of 
P-subgraphs of a graph G is denoted P(G). 

We say that H is a maximal V-subgraph of G if H is a P-subgraph of G and there is no P- 
subgraph H' of G, such that H C IS H' . We use V m {G) to denote the set of maximal P-subgraphs 
of G. The maximal V -subgraphs problem is: Given a graph G, find the set V m (G). 

3 Complexity Classes and Measures 

This paper explores the problem of computing V m (G), for a hereditary, connected-hereditary or 
rooted-hereditary property P and an arbitrary graph G. The maximal P-subgraphs problem cannot 
be solved in polynomial time, in the general case. This follows from the fact that sometimes the 
size of V m (G) is exponential in the size of G (see [3] for details). Hence, exponential time may be 
needed just to print the output. In this section, we discuss two complexity measures that are of 
interest when the output of a problem may be large: total polynomial time [10] and incremental 
polynomial time [10]. We also consider the complexity class P-enumerable [19]. 

A problem can be solved in total polynomial time, or PIO for short, if the time required to list 
all its solutions is bounded by a polynomial in n (the size of the input) and K (the number of 
solutions in the output). 2 For the maximal P-subgraphs problem, n is the number of vertices in G 
and K is the number of graphs in V m (G). 

The complexity class P-enumerable is more restrictive than the measure of total polynomial 
time. Formally, a problem is P-enumerable if the time required to list all its solutions is bounded 
by K times a polynomial in n. Note that P-enumerable differs from total polynomial time in 
that the factor of the output in the runtime must be linear. Since the size of the output may 
be exponential in the size of the input, the factor of output size in the total runtime is highly 
influential. 

Another complexity measure that is of interest when dealing with problems that may have large 
output (such as the maximal P-subgraphs problem) is incremental polynomial time. Formally, a 
problem is solvable in incremental polynomial time, or PINC for short, if, for all k, the A:-th solution 
of the output can be returned in polynomial time in n (the input) and k. Incremental polynomial 
time is of importance when the user would like to optimize evaluation time for retrieval of the first 
k maximal induced subgraphs, as opposed to optimizing for overall time. This is particularly useful 

2 This complexity measure is similar to polynomial time input-output complexity, which is commonly considered 
in database theory, e.g. [21]. 
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in a scenario where the user reads the answers as they are delivered, or is only interested in looking 
at a small portion of the total result. If a problem is solvable in total polynomial time, but not in 
incremental polynomial time, the user may have to wait exponential time until the entire output 
is created, before viewing a single maximal P-subgraph. 

Observe that every problem that is P-enumerable is also solvable in total polynomial time. 
Similarly, every problem that is solvable in incremental polynomial time is also solvable in total 
polynomial time. It is not known whether every problem that is solvable in incremental polynomial 
time is also P-enumerable, and vice-versa. The maximal P-subgraphs problem has been studied 
for many properties P. See Section 1 for several examples and see [6] for a listing of algorithms for 
combinatorial enumeration problems. 

4 Restricting the Maximal P-Subgraphs Problem 

Let V be a graph property. Suppose that we want to show that the maximal P-subgraphs problem 
is in PIO. To do this we must devise an algorithm that, when given any graph G, produces V m {G) 
in polynomial time in the input (i.e., G) and the output (i.e., V m (G)). For many properties V, it 
is difficult to find such an algorithm, since an arbitrary graph G must be dealt with. Our task of 
finding an appropriate algorithm is even more difficult if we actually want to show that the maximal 
P-subgraphs problem is P-enumerable or is in PINC. Hence, we focus on restricted versions of the 
maximal P-subgraphs problem. For these restricted versions, it is often easier to devise an efficient 
algorithm. Later on we will show how, given an algorithm for one of the restricted problems, the 
general problem can be solved. 

Let G be a graph and let P be a property. We use G — v to denote the induced graph of G 
that contains all vertices other than v. We say that G almost satisfies V if there is a vertex v in 
G, such that G — v € V. Let v' be a vertex in G. We use V m (G; v') to denote the subset of V m (G) 
that contains graphs with the vertex v'. 

We will be interested in three restricted versions of the maximal P-subgraphs problem. 

• The input-restricted maximal V-subgraphs problem is: Given a graph G that almost satisfies 
P, find all maximal "P-subgraphs of G. 

• The output-restricted maximal V-subgraphs problem is: Given an arbitrary graph G and a 
vertex v' in G, find all maximal P-subgraphs of G that contain v'. 

• The io-restricted maximal V-subgraphs problem is: Given a graph G that almost satisfies P 
and given a vertex v' in G, find all maximal P-subgraphs of G that contain v'. 

Note that the output-restricted maximal P-subgraphs problem can be used in a straightforward 
way to solve the maximal P-subgraphs problem. However, it is not clear how the other two problems 
can be used to solve the maximal P-subgraphs problem. 

The complexity of these three problems is highly dependent on the graph property P. Some- 
times, it turns out that the input-restricted maximal P-subgraphs problem and the io-restricted 
maximal P-subgraphs problem can actually be solved in polynomial time, since the number of 
graphs in their output is bounded in size by a constant. However, the fact that G almost satisfies 
V does not always entail that V m (G) is small. This is shown in the following example. 
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Figure 1: Finding Maximal Bipartite Subgraphs 

Example 4.1. Let V C mp be the connected- hereditary property that contains all connected bipartite 
graphs. Suppose that G is a graph that almost satisfies 7-cbip- It is not difficult to see that T J ^ 1V (G) 
contains at most three graphs. As an example, consider the graph G± in Figure 1. The graph G\ 
almost satisfies P C bip 5 since G\ — w is a connected bipartite graph. The set V^ IP (Gi) contains the 
following three graphs: (1) G\ — w, (2) G\ — U2 (derived by removing the neighbors of w on the 
bottom side) and (3) G\ — {i>i,i>2,i*i} (derived by removing the neighbors of w on the top side, 
and then removing unconnected vertices). 

Let Pbip be the hereditary property that contains all bipartite graphs. It is possible for the size 
of V^ P (G) to be exponential in the size of G, even if G almost satisfies P B ip- Consider, for example, 
the graph G2 in Figure 1. The graph G2 almost satisfies V B ip since G2 — w £ V B ip- However, the 
set V m {G2) contains 2 n + 1 graphs, i.e., G2 — w and the graphs derived by choosing the vertex w 
and one from each pair of vertices (vi,Ui), for all i. Notwithstanding the size of V m (G2), it is not 
difficult to show that the input-restricted 7-Bip-subgraphs problem is P-enumerable. To see this, 
observe that it is possible to find the graphs in V m {G2) efficiently, in terms of the input and the 
output, by dealing separately with each connected component of G2 — w. □ 

By formalizing the intuition presented in Example 4.1, the following propositions can be shown. 
Similar propositions can be shown for other graph properties. 

Proposition 4.2. The io-restricted V GmP - subgraphs problem is in PTIME. 

Proposition 4.3. The input-restricted V BlP - subgraphs is P-enumerable. 

5 Hereditary Properties 

In this section, we reduce the maximal 'P-subgraphs problem to the input-restricted maximal V- 
subgraphs problem for hereditary properties V . Our reduction is by means of an algorithm that 
shows how to compute V m {G) for an arbitrary graph G, given a procedure that can compute V m (G) 
for graphs G that almost satisfy V. 

In Figure 2, the algorithm GenHered(P) is presented. This algorithm uses the following two 
procedures. 

• Max (V)(H, G): This procedure receives graphs H and G as input and returns true if H is a 
maximal "P-subgraph of G and false otherwise. This procedure can easily be defined in terms 
of Sat(V), by (1) checking if H eP and (2) extending H with each vertex in G and checking 
whether any extension is in V (using Sat('P)). 
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Algorithm: GenHered('P) 

Input: Graph G = {{v\, ... ,v n },E,r) 

Output: Maximal P-Subgraphs of G, i.e., V m {G) 



1 g~{o } 

2 for i := 1 to n 

3 do U := Q 

4 H e HQ := Q - {H} 

5 H' G GenRestrHered('P) (G[_ff, if Mxx(V)(H',G[{vi, . . . ,Vi}]) 

6 then g := Q U {H'} 

7 return Q 



Figure 2: Algorithm to compute V m (G) for hereditary properties 

• GenRestrHered^)^): This procedure receives a graph H that almost satisfies V and 
returns the set V m (H). This procedure is not defined in this paper. Instead, it must be 
provided on a per-property basis. 

In essence, our algorithm reduces the maximal P-subgraphs problem to the input-restricted maxi- 
mal P-subgraphs problem by using the procedure GenRestrHered(;P). 

The algorithm GenHered('P} starts with the set Q = {Oo}, where 0$ is an empty graph, i.e., 
a graph with no vertices or edges. It then continuously (Line 6) attempts to extend each graph 
H in Q with an additional vertex v\. The graphs in V m (G[H, Vi]) that are maximal with respect 
to the vertices seen thus far are inserted into Q (Line 7) . This step is critical, since it (1) inserts 
graphs that are needed in order to create the final result and (2) avoids inserting extra graphs that 
would cause Q to grow exponentially. 

Let G be a graph with n vertices. Suppose that there are K graphs in V m (G). We show 
that GenHered(7 :, )(G) correctly computes V m {G) and analyze the runtime of our algorithm as a 
function of n and K. We use s-p(n) to denote the amount of time needed to check if G € V, i.e., the 
runtime of Sat(7 7 )(G). Observe that Max(V)(H,G) runs in 0(n s-p(n)) time. We use rp(n, K) to 
denote the amount of time needed to compute V m (G), when G almost satisfies V, i.e., the runtime 
of GenRestrHered^^G). Note that r-p is a function of both the input and the output. 

Theorem 5.1. Let V be a hereditary property and let G be a graph with n vertices. Let K be the 
number of graphs in V m (G). Then 

• GenHered(P)(G) =V m {G) and 

• GenHered(7 3 )(G) runs in time: 0(n 2 s v {n) K r v (n, K)) . 

Proof (Sketch). We use Gi to denote the induced subgraph of G containing exactly the vertices 
vi,...,Vi, i.e., G[{wi, . . . , Vi}]. We use Gi to denote the value of Q after % iterations of the loop in 
Line 2 of the algorithm. We show by induction on the number of vertices k in G that = V m (Gk). 
The inclusion Qk Q V m (Gk) can be shown by a case analysis of the lines in which graphs are added 
to (and removed from) Q^. The inclusion V m (G^) C Q k follows from the following inclusion: 

V m {G k ) C V m (G k ^) U ( U Kpm(Gt _ l} V m (G[H, v k })) . 
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The runtime follows from a careful analysis of the algorithm and from the fact that \V m (Gi-\)\ < 
\V m {Gi)\, for alii □ 



Corollary 5.2. Let V be a hereditary property. Then the maximal V-subgraphs problem is in PIO 
if and only if the input-restricted maximal V-subgraphs problem is in PIO. 

Corollary 5.3. Let V be a hereditary property. Then the maximal V-subgraphs problem is P- 
enumerable if the input-restricted maximal V-subgraphs problem is in PTIME. 

Corollary 5.4. The maximal V BIP - subgraphs problem is in PIO. 

6 Connected-Hereditary and Rooted-Hereditary Properties 

GenHered(P) may fail to return the correct graphs if V is connected hereditary or rooted heredi- 
tary. Intuitively, this failure is caused by the fact that an induced subgraph H may not be connected 
or rooted (and therefore, H g" V), even though there is a graph G such that H C IS G and G G V. 
In other words, the order in which we choose the vertices can effect the success of the algorithm. 

In this section, we solve the maximal P-subgraphs problem for connected-hereditary and rooted- 
hereditary properties in the following way: 

• The maximal P-subgraphs problem is reduced to the output-restricted maximal P-subgraphs 
problem in a straightforward fashion. 

• The output-restricted maximal 'P-subgraphs problem is reduced to the io-restricted maximal 
P-subgraphs problem by means of the algorithm GenWithVertex(P) (see Figure 3). The 
result of calling GenWith Vertex(P)(G, v r ), for an arbitrary graph G, is the set V m (G; v r ). 
Note that GenWith Vertex (P)(G, v r ) uses GenRestrWithVertex(P)(G, v r ) which gen- 
erates V m (G'; v r ) for graphs G' that almost satisfy V. 

In the remainder of this section, we explain the algorithm GenWith Vertex(P) — its notation, 
data structures and flow of execution. We show its correctness and analyze its runtime. 

Notation. Consider graphs G and H such that H C IS G. We say that a vertex v in V(G) — V(H) 
is an undirected neighbor of a vertex v' in H if either the edge (v, v') or the edge (v 1 , v) is in G. 
Similarly, we say that v G V(G) — V{H) is a directed neighbor of v' in H if the edge {v ' , v) is in G. 
Note that the neighbors (directed or undirected) of vertices in an induced subgraph are not in the 
induced subgraph. 

Given a property V, we use N-p(H,G) to denote the set of undirected neighbors of H if V is 
connected-hereditary and to denote the set of directed neighbors of H if V is rooted-hereditary. 
Note that we use V only in order to differentiate between undirected and directed neighbors. 

In our algorithm, a graph H C IS G (such that H G V) is associated with a set of vertices 
V(H). Intuitively, this set contains vertices v' G N-p(H, G) that cannot be used to extend H, since 
G[H,v'} g-p. 
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Data Structures. Gen With Vertex (V) uses two stacks to collect graphs: Stack\ and Stack2- 
Stacki contains graphs for which processing is incomplete. Therefore, a graph H will be in Stacki 
if it has a v' G N-p(H,G) that is not in V{H). For such a vertex v', it is not yet known whether 
v' can be added to H, i.e., whether G[H,v] G V. Stack2 contains graphs for which processing is 
complete. Therefore, a graph H will be in Stack2 if N-p(H, G) C V(ii'). 

To ensure that Stacki and Stack? contain the proper graphs, our algorithm uses the procedure 
PUSH APPROPRI ATE (7 3 )(i?,G, Stacks Stack 2 ), which does the following. If N V (H,G) £ V(H), 
then the procedure adds H to the top of Stacki. Otherwise, the procedure adds H to the top of 
Stack? ■ 

Flow of Execution. The algorithm Gen With Vertex (V) starts by considering the graph G[{v r }} 
Then, it continually extends graphs in Stacki with neighboring vertices to derive larger graphs that 
are in V. All extensions created must contain the vertex v r (so that we will only create graphs in 
V m (G; v r )). Suppose H G Stacki and v G N P (H,G). We deal with the case in which G[H,v] G V 
in Lines 9-10. We deal with the case in which G[H,v] G" V in Lines 11-25. 

Correctness and Runtime Analysis. The proof of correctness of Gen With Vertex (P) is 
rather intricate and has been omitted due to lack of space. However, we take note of the behavior 
of our algorithm that is critical in proving its correctness: 

• The fact that V(H) is assigned the empty set every time that a vertex is added to H, allows 
us to be prove that all graphs in V m (G; v r ) are returned. 

• Graphs G' from V m (G[H, v]; v r ) (see Line 12) are not immediately added to Stacki or Stack2- 
Instead we first try to combine G' with existing graphs in Stacki. We also check if G' is an 
induced subgraph of a graph in Stack 2 . Only if we have not succeeded in either of these 
actions, do we add G' to Stacki or Stack2, as appropriate. This prevents Stacki and Stack2 
from growing too big. It also ensures that only graphs from T jm (G; v r ) are returned, and 
each such graph is returned only once. 

In order to prove our complexity analysis of the runtime of GenWithVertex('P), we must show 
that GenRestrWith Vertex (V) does not create more graphs than the number of graphs in the 
result of GenWithVertex(7 7 ). This holds because we are able to prove that H C IS G implies that 
\V m (G[H]; v r )\ < \V m (G; v r )\ if either (1) V is connected-hereditary or (2) V is rooted-hereditary 
and v r is the root of G. 

Let G be a graph with n vertices. We use r!p(n,K) to denote the amount of time needed to 
compute V m (G; v) for a graph G that almost satisfies V and an arbitrary vertex v, i.e., the runtime 
of the procedure GenRestrWithVertex('P)(G', v). Note that r' v is a function of both the input 
n and the number of graphs in the output K. The function s-p is defined as before. 

Theorem 6.1. Let V be a connected-hereditary or rooted-hereditary property. Let G be a graph 
with n vertices, and let v r be a vertex in G. Suppose that G[{v r }] G V . Let K be the number of 
graphs in V m (G; v). Then 



• Gen With Vertex (V)(G,v r ) = V m (G; v r ) and 



• GenWithVertex(7 7 )(G, v r ) runs in time: 0(n 2 K 2 r' v (n,K)(s-p(n) +n 
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Algorithm: 

Input: 

Output: 



Gen With Vertex ("P) 
Graph G and Vertex v r 

Maximal answers that contain vertex v r , i.e., V m (G; v r ) 







1 V{G[{v r }]):=% 



2 


Stacki := 


3 


Stack2 := 


4 


PushAppropriate(:P}(G[{iv}], G, Stacki, Stack 2 ) 


5 


while Stacki ^ 


6 


do H := Stacki.POPQ 


7 


let v be a vertex in N V {H, G) - V(H) 


8 


if S at {V)(G[H, v}) 


9 


then V(G[H,v}) := 


10 


PushAppropriate('P) (G[i?, v] ) 


11 


else F(ff) := V(H) U {w} 


12 


G' G GENRESTRWlTHVERTEX(^)(G[ff,w],W r ) - {H}V(G') : = 


13 


inserted := false 


14 


H' e 5tacfci s.t. Sat{V) (G[G',H'})G new := G[G',H'] 


15 


V(G new ) := 


16 


Stacki.REMOVE(H') 


17 


PuSHAppROPRiATE(7 :> )(G„ eu ,, G, Stacki, Stack2) 


18 


inserted := true 


19 


if exists H 1 e Stack 2 s.t. V{G') C V(if') 


20 


then inserted := true 


21 


if not (inserted) 


22 


then PushAppropriate(7 ;> )(G / , G, Stacki, Stack 2 ) 


23 


PuSHAppROPRiATE(7 : ')(f/', G, Stacki, Stack 2 ) 



24 return Stack 2 



Figure 3: Algorithm to compute V m (G; v r ) for connected-hereditary or rooted-hereditary properties 

Theorem 6.2. Let V be a connected-hereditary or rooted-hereditary property. The maximal V- 
subgraphs problem is in PIO if the io-restricted maximal V -subgraphs problem is in PIO. 

Proof (Sketch). Since any vertex in G may or may not appear in a solution of a connected- 
hereditary property, it is possible to compute V m (G) for connected- hereditary properties by calling 
GenWithVertex('P) for every vertex in G. If V is a rooted-hereditary, every graph in V m (G) must 
contain the root of G. Hence, it is possible to compute V m (G) by calling GenWithVertex('P) 
with the root of G. □ 

If V is rooted hereditary and v r is the root of G, then V m {G; v r ) = V m [G). Hence, we can 
show the following result. 

Corollary 6.3. Let V be a rooted-hereditary property. The maximal V-subgraphs problem is in 
PIO if and only if the input-restricted maximal V-subgraphs problem is in PIO. 
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7 Extending the Algorithms 



In this section we discuss some small changes that can be made to the algorithms GenHered(P) 
and Gen With Vertex (V) in order to improve the complexity results from the previous sections. 

P-Enumerable for Connected-Hereditary and Rooted-Hereditary Properties. In Corol- 
lary 5.3, we presented a sufficient condition for the maximal "P-subgraphs problem to be P- 
enumerable, for hereditary properties V. The algorithm Gen With Vertex (V) cannot be used in 
order to derive a sufficient condition for this problem to be P-enumerable for connected-hereditary or 
rooted-hereditary properties, since K appears quadratically in the runtime of GenWithVertex("P). 

It turns out that one can adapt GenHered('P) to derive an algorithm that computes V m (G; v r ), 
for a rooted-hereditary or connected-hereditary property V, provided that certain conditions hold. 
The crux of the change to GenHered('P) is in careful choice of the order in which to iterate over 
the vertices in G. The adapted algorithm can be used similarly to GenWithVertex('P) in order 
to compute V m {G). 

Theorem 7.1. Suppose that the input-restricted maximal V-subgraphs problem is in PTIME. Then, 
the maximal V-subgraphs problem is P-enumerable if (1) V is rooted- hereditary and G is acyclic or 
(2) V is connected-hereditary and the underlying undirected graph of G is a tree. 

Incremental Polynomial Time. None of the complexity results presented have provided con- 
ditions for the maximal P-subgraphs problem to be solvable in incremental polynomial time. By 
slightly changing the procedure GenWithVertex(P) we can derive an algorithm that computes 
V m (G; v r ) in incremental polynomial time for an important special case. Using this adapted algo- 
rithm, V m (G) can also be computed in incremental polynomial time, for connected- hereditary and 
rooted-hereditary properties. Our adapted algorithm can also be used for a hereditary property 
V, by reducing V to an appropriately defined rooted-hereditary property. In addition, we derive a 
polynomial complexity result for returning k maximal induced subgraphs, for any constant k. 

Theorem 7.2. Let V be hereditary, connected-hereditary or rooted-hereditary property. Sup- 
pose that the io-restricted maximal V-subgraphs problem is in PTIME. Then, (1) the maximal 
V-subgraphs problem is in PINC and (2) k graphs from V m (G) can be returned in polynomial 
time, for any constant k. 

Proof (Sketch). Item 2 follows directly from Item 1. To show Item 1, let G be a graph with n 
vertices. By careful observation, one may note that after at most n 2 iterations of the loop in Line 
5 of GenWithVertex(P), an additional graph will be in Stack2- One can take advantage of this 
fact to adapt Gen With Vertex (V) so that it will run in incremental polynomial time, by having 
PushAppropriate('P) print graphs as it adds them to Stack2- (Care has to be taken not to print 
graphs that appeared before in a previous call to GenWithVertex('P).) □ 

Corollary 7.3. The maximal V C bip problem is in PINC. 

8 Conclusion 

This paper reduces the maximal P-subgraphs problem to restricted versions of the same problem by 
providing algorithms that solve the general problem, assuming that an algorithm for a restricted 
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version is given. Our results imply that when attempting to efficiently solve the maximal V- 
subgraphs problem, it is not necessary to define an algorithm that works for the general case. 
Instead, an algorithm for restricted cases must be defined. An efficient method for solving the 
maximal "P-subgraphs problem for the general case is automatically derived from our algorithms. 

Sometimes it turns out that algorithms for restricted cases of the maximal P-subgraphs problem 
are straightfoward. For example, this is the case with the properties P B i P and Pcbip- There are 
additional properties for which this holds, e.g., the set of independent sets, the set of star graphs, 
etc. Thus, our results immediately imply that the maximal "is an independent set" -subgraphs 
problem is both P-enumerable and in PINC, and the maximal "is a star graph" -subgraphs problem 
is in PINC. Note that it is significantly easier to come up with algorithms that solve the restricted 
versions of these problems than to come up with algorithms that solve the general cases. 

Interestingly, our results can be applied to the database problem of computing maximal query 
answers. Well-known semantics for this problem, e.g., full disjunctions [7], can be modeled as graph 
properties. It is often easy to define algorithms that solve the restricted versions of the maximal P- 
subgraph problem, for graph properties that correspond to semantics for incomplete information. 
Hence, the results in this paper have immediate practical applications for efficiently computing 
maximal query answers. 
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